Efficient codebooks for fast and accurate low resource ASR systems
نویسندگان
چکیده
منابع مشابه
Efficient codebooks for fast and accurate low resource ASR systems
Today, speech interfaces have become widely employed in mobile devices, thus recognition speed and resource consumption are becoming new metrics of Automatic Speech Recognition (ASR) performance. For ASR systems using continuous Hidden Markov Models (HMMs), the computation of the state likelihood is one of the most time consuming parts. In this paper, we propose novel multi-level Gaussian selec...
متن کاملEfficient codebook for fast and accurate low resource ASR systems
Nowadays, speech interfaces have become widely employed in mobile devices, thus recognition speed and power consumption are becoming new metrics of Automatic Speech Recognition (ASR) performance. For ASR systems using continuous Hidden Markov Models (HMMs), the computation of the state likelihood is one of the most time consuming parts. Hence, we propose in this paper novel multi-level Gaussian...
متن کاملEfficient Harvesting of Internet Audio for Resource-Scarce ASR
Spoken recordings that have been transcribed for human reading (e.g. as captions for audiovisual material, or to provide alternative modes of access to recordings) are widely available in many languages. Such recordings and transcriptions have proven to be a valuable source of ASR data in well-resourced languages, but have not been exploited to a significant extent in under-resourced languages ...
متن کاملLow complexity techniques for embedded ASR systems
This paper deals with the problem of reducing the computational complexity of ASR algorithms for embedded systems. Particularly, three methods for simplifying the computation of state observation likelihoods of continuous density based HMMs are proposed. Feature component masking, variable-rate partial likelihood update and density pruning all result in significant savings in the decoding compl...
متن کاملA Low-Resource ASR Back-End Based on Custom Arithmetic
Most contemporary ASR systems running on desktops use continuous-density HMMs (CHMM) with floating-point representations. It is important to reduce their memory and power requirements so that they can be more affordable for portable devices. In this paper, we propose a novel speech recognition back-end based on custom arithmetic, where all floating-point variables are represented by integer ind...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Speech Communication
سال: 2009
ISSN: 0167-6393
DOI: 10.1016/j.specom.2009.01.010